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Global Variational Method for Fingerprint 
Segmentation by Three-part Decomposition 

D.H. Thai* and C. Gottschlich* 


Abstract 

Verifying an identity claim by fingerprint recognition is a common¬ 
place experience for millions of people in their daily life, e.g. for un¬ 
locking a tablet computer or smartphone. The first processing step after 
fingerprint image acquisition is segmentation, i.e. dividing a fingerprint 
image into a foreground region which contains the relevant features for 
the comparison algorithm, and a background region. We propose a novel 
segmentation method by global three-part decomposition (G3PD). Based 
on global variational analysis, the G3PD method decomposes a finger¬ 
print image into cartoon, texture and noise parts. After decomposition, 
the foreground region is obtained from the non-zero coefficients in the 
texture image using morphological processing. The segmentation perfor¬ 
mance of the G3PD method is compared to five state-of-the-art methods 
on a benchmark which comprises manually marked ground truth segmen¬ 
tation for 10560 images. Performance evaluations show that the G3PD 
method consistently outperforms existing methods in terms of segmenta¬ 
tion accuracy. 


1 Introduction 

Fingerprint verification is a widely used authentication method in commercial 
applications and most fingerprint verification systems rely on minutiae for com¬ 
paring two fingerprints. Typical steps of fingerprint image processing [I] include 
segmentation, orientation field estimation [2], image enhancement by contextual 
filtering ma and minutiae extraction. Additionally, many systems include 
nowadays a software-based liveness detection module which can e.g. be based 
on histograms of invariant gradients 0 as a countermeasure against so-called 
spoof attacks. In this paper, we focus on the fingerprint image segmentation step 
and we propose a global three-part decomposition (G3PD) method to achieve 
an accurate extraction of the foreground region. 
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1.1 Global three-part decomposition (G3PD) method 

Our proposed method is based on the paradigm that a fingerprint image can be 
considered as a composition of three components: texture, homogeneous parts 
and small scale objects. The G3PD method aims to decompose a fingerprint 
image into the corresponding three parts: 

• Texture image: By texture we refer to the fact that fingerprint images 
are highly determined by their oriented patterns which have frequencies 
only in a specific band in the Fourier spectrum, see [©. 

• Cartoon image: The homogeneous regions correspond to the lower fre¬ 
quency response. 

• Noise image: Small scale objects staying in the higher frequency band 
are considered as noise, e.g. black dots with random position and intensity. 

For the purpose of fingerprint segmentation, we are only interested in the tex¬ 
ture image as a feature for segmentation. After the decomposition, the cartoon 
and noise images are ignored. Therefore, the decomposition can be considered 
as a feature extraction step which has the goal to estimate the best possible 
texture image for a given input image. Subsequently, the region of interest 
(ROI) is obtained by morphological operations on the non-zero coefficients in 
the extracted texture image, see Figure [l] In order to achieve these goals, we 
propose a model for three-part decomposition with variational based methods 
as described below. The G3PD method follows the same philosophy of texture 
image extraction as the Fourier based FDB method [B] , but regards the problem 
from a different point of view and solves it by a variational approach. 

Proposed variational model for G3PD Decomposition techniques are at 
the core of variational methods. Decomposition is performed by finding the 
solution of a convex minimisation problem. Inspired by this idea, we propose 
a novel model for global three-part decomposition which has five ingredients: 
(1) Cartoon: Piecewise constant regions are measured by the anisotropic total 
variation (TV) norm [7]. (2) Texture: The sparsity of the texture pattern is 
measured by the t\ norm which is well-known to enhance the sparseness of the 
solution. (3) Texture: The smoothness of the texture image is enforced by 
the t\ norm of the curvelet coefficients. (4) Noise: Noise is measured by the 
supremum norm of its curvelet coefficients. (5) Reconstruction constraint: 
Finally, the constraint / = u+v+e ensures that the sum of the three component 
images reconstructs the original image f. Empirically, we have found that 
the curvelets capture the geometry of fingerprint patterns better than classical 
wavelets, see Section [5X4] 

The combination of the decomposition and morphology in our proposed 
G3PD method yields segmentation performance superior to existing segmen¬ 
tation methods. 
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Error = 2.68% 


Figure 1: Overview over the segmentation by the G3PD method: Firstly, the original 
image / is decomposed into cartoon image u, texture image v and noise image e. Sec¬ 
ondly, the texture image v is binarized by separating zero from non-zero coefficients 
and the foreground regoin is obtained by morphological operations. In order to evalu¬ 
ate the segmentation performance, the estimated ROI (second row, second column) is 
compared to manually marked ground truth segmentation (second row, first column). 
Note that the cartoon image u and noise image e contain also texture parts but this 
choice of parameters leads to a better segmentation performance as demonstrated in 
evaluations on a benchmark with 10560 images, see Section [3] 


Performance Evaluation and Comparison to Existing Methods We 

conduct a systematic performance comparison of our proposed G3PD method 
with five state-of-the-art fingerprint segmentation methods. The segmenta¬ 
tion accuracy of all methods is measured on a manually marked ground truth 
database containing 10560 images [B]. A detailed description of the evaluation 
benchmark, training and test protocols, and experimental results is given in 
Section [3j The five methods in the comparison are: a method based on mean 
and variance of grey level intensities and the coherence of gradients as features 
and a neural network as a classifier [5], a method using Gabor filter bank re¬ 
sponses [5] , a Harris corner response based method m, an approach using local 
Fourier analysis PH and the factorized directional bandpass method 0. 

1.2 Related Work 

With more than hundred methods, we refer the reader to [Bj for an overview 
over the literature of fingerprint segmentation methods. For image segmentation 
in general, there is a plethora of approaches to solve this problem. These are 
based e.g. on the intensity of pixels 12i, [13], [14], or the evolution of curves for 
piecewise smooth regions in images HE m, HE [IB] . Texture segmentation, 
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however, is still an open problem, because intensity values are inadequate, e.g. 
for segmenting fingerprint patterns. Methods based on texture descriptors m, 
[20] or finding other meaningful features in an observed image for classification 
have been suggested. 

Based on the classical Rudin-Oslrer-Fatemi (ROF) model [21] . researchers 
have proposed numerous approaches in which the regularisation and fidelity 
terms are considered under different functional spaces, such as Besov, Hilbert 
and Banach spaces [22], [23], [24], [25], [26], [27] . Further image denoising 
approaches use higher-order derivatives instead of total variation for minimi¬ 
sation [2B], [22], mean curvature [3D], Euler’s elastica [3Ii, and total variation 
of the first and second order derivatives [32], and higher-order PDEs for diffu¬ 
sion solved by directional operator splitting schemes [33] . In particular, many 
signals have sparse or nearly-sparse representations in some transform domain 
corresponding to or its regularisation l\ [34] . [35], [36|, EH, m- Aujol and 
Chambolle [391 introduced a model for three-part decomposition which yields 
a texture image v using the G-norm. An improvement of the G3PD model in 
comparison to their work is especially the texture image extraction by enforcing 
smoothness and sparsity on the texture image v. To solve the constrained min¬ 
imisation problems, various techniques have been suggested such as Chambolle’s 
projection [3D] , splitting Bregman method [7], iterative shrinkage/thresholding 
(1ST) algorithms 0T], @2, [13] ■ Wu et al. g3] has proved the equivalence 
between augmented Lagrangian method (ALM), dual methods, and split Breg¬ 
man iteration. We have adopted ALM into our approach to solve the proposed 
constrained minimisation problem. [45], 46] and 47j show that the shrinkage 
operator of multiresolution analysis is the solution of a variational problem when 
considering signals in Besov space, i.e. R“ g , relating to wavelet coefficients. In 
this paper, we focus on the curvelet transform gg, 02i, m , m which is very 
suitable for fingerprint patterns with oriented and curved lines. However, one 
can easily adopt our approach for the shearlet transform [52], the contourlet 
transform [53], or the steerable wavelet transform [54] , 

There are many difficulties relating to the choices of the parameters for 
decomposition and minimisation steps in all aforementioned approaches which 
ensure the convergence of the algorithm and extract enough texture for segmen¬ 
tation under the various situations, such as different illumination, noise, and 
ghost fingerprints (see Figure [2] for an illustration). To solve these problems is 
still a challenge in practice. 


1.3 Setup of the paper 

The organisation of the paper is as follows. In Section [2] we give a detailed de¬ 
scription of the G3PD method in two main steps: first, texture image extraction 
is treated in Section |2.1| followed by morphological operations in Section [2~2 


see Figure [l] To this end, we introduce the G3PD model in Section 2.1.1 which 
defines the objective function as a constrained minimisation problem for the 
decomposition of an image into three parts: cartoon, texture and noise images. 
Next, in Section |2.1.2| we apply the augmented Langrangian method to refor- 
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Figure 2: Typical difficulties for segmentation encountered in fingerprint anal¬ 
ysis. (a) Small scale objects and noise on the sensor, (b) Ghost fingerprint, (c) 
Image with illumination differences. 


rnulate the constrained minimisation into an unconstraint one. Subsequently, 
this unconstrained minimisation problem is solved by the alternating direction 
method of multipliers (ADMM) in Section 2.1.3 The smoothness and sparsity 
of the obtained texture image as a feature for segmentation is discussed in Sec¬ 
tion |2.1.4| In Section |2.2[ we specify how to obtain the ROI from the texture 
image by morphological operations. In Section [3] we describe the evaluation 
benchmark, the training and test protocols, and experimental results. Finally, 
in Section [4] we discuss the results of the evaluation and we give conclusions. 
Additional figures and detailed calculations can be found in [551 . 


2 The G3PD Method for Fingerprint Segmen¬ 
tation 

This section describes the G3PD method which consists of two main parts: in 
the following Section |2.1[ we introduce a model for three-part decomposition 
into cartoon, texture and noise images. Next, we formalize the constrained 
minimisation problem and we discuss the ALM for solving it. In Section [2.2[ we 
utilize the obtained texture image as our feature to perform the segmentation 
by morphological operations. 

2.1 Fingerprint Texture Extraction 

2.1.1 The G3PD Model 

As argued before, the fingerprint / is considered as a composition of a homoge¬ 
neous region it, repeated patterns v staying in a frequency range in the Fourier 
domain and corrupted by certain random noise e. Fundamental for our analysis 
is that we assume that the fingerprint pattern is sparse in the Fourier domain as 
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the ridge lines form an oscillating signal at essentially one frequency, locally. A 
two-dimensional image / : Cl — > M + is specified on the lattice with size Ni x IV 2 : 

Cl = jfc = (fci, k 2 ) € {0, N ± - 1} x {0, N2 1} Cl N 2 |, 

We assume that 


f[k] = u[k] + v[k] + e[k], Vfc £ Cl, 

where f,u,v and e are in matrix form, i.e. f = [m] ken - 

The space B\ ± relating to the l\ norm of the wavelet coefficients (cf. @B]), 
i.e. ||i’|| B i i = is very suitable to measure the smoothness of the 

oscillation signals. However, due to a set of highly curved lines in the fingerprint 
patterns, the i\ norm of curvelet coefficients is considered instead to capture 
their curvature in texture v. Let C{v} = [C.j _/{«}[&]].. ; k ^ x denote the discrete 
curvelet transform of v in i different scales and l orientations at positions k 
contained in the index set I. The C\ norm of its curvelet coefficients is||C{w}|| . 

In order to get the sparse texture v in the spatial domain, the norm is 
adopted. In conclusion, the norms {||C{ , u}|L +||u|| f } are considered to extract 
the fingerprint patterns. Then, the bounded variation space with the discrete 
TV-norm, i.e. J(u) = || \7 d u\\ ti (cf. [551 for the definition of the discrete gradient 
operator V d ), is well-known to measure the roughness of a piecewise constant 
image u pH! . Finally, the residual e is measured by the supremum norm of its 
curvelet coefficients , i.e. 

||C{c}|L = sup |Ci > z{e}[fc]| . 

00 i,l,kex 

Thus, the constraint of the minimisation is defined via the supremum norm 
of the curvelet coefficients of the residual, i.e. ||C{/ — u «}|| £ , being less than 
a threshold S. In summary, the variational solution we advocate for separating 
a fingerprint into texture, cartoon and noise in the Euclidean space X whose 
dimension is given by the size of the lattice Cl, i.e. X = Rl n l, is defined as 

(u,v) = argmin <M|V d w|L +^i||C{u}|| +/r 2 |ML s.t. sup \C it i{f - u - v}[k] I < S 
(■ u,v)ex 2 l 1 qz.fcex J 

^ v- / 

= ||C{/-u-«}||^ 

( 1 ) 

Note that the form of |l]) is analogous to the statistical multiresolution estimator 
in I56j where the nonlinear transformation is the absolute value of the curvelet 
coefficients, i.e. A(-) = |C{-}|, the length of subsets | 6 >| = 1 and the weight- 
function uj s = 1. The main difference is that our model has two variables 
( u,v ). With the residual e = f — u — v, the constrained minimisation ([lj is 
rewritten as 

(u,v,e) = argmin j||V d u||^ + /ii||C{v}||^ + M 2 IMI 4 s -t- ||C{e} ||< S , f = u + v + 

(u,v,e)EX 3 ^ 

( 2 ) 
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Given S > 0, denote G*(|) as the indicator function on the feasible convex set 
S(S) of @, i.e. 


S(S) 


eex I ||C{e} 


too 



and 



0, e G S{6) 
+oo, e e X\S{6). 


By changing the inequality constraint into the indicator function G* is 

rewritten as a convex minimisation of four convex functions and one equality 
constraint: 


(u,v,e) = argmin < || V d u|L + /+ ||C{u}|| + M 2 IML + G* ( | ) s.t. f = u + v + e 

{■u,v,e)£X3 { 1 W 

(3) 

The original image / is therefore decomposed into the piecewise constant image 
u, the texture v and the small scale objects modeling as noise e by minimizing 
the objective function 


2.1.2 Augmented Lagrangian Method to Reformulate the Constrained 
Minimisation Problem in Equation ([3]) 

There are different kinds of norms in §• In order to simplify the calculation, 
we introduce new variables 

Ip = V d u = [p-| ,p 2 ] T 

\ w = [™+]( M)e z = C M- 

Then, (|3| becomes a constrained minimisation and we apply the ALM. Given 
space Y = X x X, the augmented Lagrangian function of ([3]) with the three 
Lagrange multipliers (Ai, A 2 , A 3 ) is defined as 


(it* ,v* ,e* ,w* ,p*) = argmin C(u , v , e , w . p; Ai, A 2 , A 3 ), 

u,v,e,w,pEX 3 xRl 2 -! xY 


(4) 


where 


C(u,v,e,w,p\ Ai, A 2 , A 3 ) = ||p||^ + M 1 IMI+ + p 2 |M |* 1 + G* ^ 


Pi 

2 


p - V d u 


Ai 

~ih 


Pi 

2 


w —C{v} + 


P2 


Pi 

2 


f — u — v — e - 


P 3 


The minimizer of Q is numerically computed through iterations n = 1,2,... 


(i/ n ), u 1 


n) ,e(”),iu( n ),p ( ")' = 


\ (n— 1) \ (n— 1) \(n— 1)\ 

argmm L[u , v , e , w ,p; , A 3 'J 

u,v,e,w,p£X 3 xIRl 2 -! xY 

(5) 
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and the Lagrange multipliers are updated after every step n with a rate 7 and 
the initial values A^ 0) = A^ = Ag 0 "* = 0 : 

U n) = A^ _1) + 7/3i(p (n) - V d «W) 

<A^ n) = + 7 /? 2 (w (n) -C{»W}) 

[aJ” } = A^ _1) + 7/3 3 (/-u(") -«(”) -eW) 

As the number of iterations n goes to infinity, we obtain the true solution of ©• 
However, to reduce the computational time in practice, we stop after a small 
number of iterations. Hence, we gain an approximated solution (cf. Algorithm 

!)• 


Algorithm 1 Augmented Lagrangian method (ALM) for the approximated 
solution of @_ 

Initialisation: = v^ = e® = p^ = w = A ^ = A^ = Ag 0 " 1 = 0 

for n = 1 to N do 

1 . Compute the approximated solution ( y u^ n \v^ n \e^ n \w^ n \p^ n ' > ^ : 


(u( n ),D( n ),e( n ),tr("),p ( ")) = argmin C{u,v,e,w,p; aJ" _ 1) , A^ n_1) , A^* _1) 

u,v ,e,w ,p 

( 6 ) 

2. Update Lagrange multipliers (A^, A^, Ag”)): 


k (n) 


i(n) 


= Ag" -1) + 7 ^(pW-VduW) 
+ 7 — C{v^}) 


i( n ) _ 


(7) 


= + 7 /3 3 (/-w (n) - v (n) -e (n) ) 


end for 


In the following part, we describe the algorithm to solve the minimisation 
problem ([b]) by the alternating direction method of multipliers (ADMM). 

2.1.3 Alternating direction method of multipliers and numerical im¬ 
plementation 

Similarly to 122 , m, m, 153 , m, eqi , this section describes the procedure 
how to solve the minimisation ([ 6 ]) and the method to discretize the solution. 

The solution of § is determined by alternatively minimizing the objective 
function with respect to u while fixing v , e,p, w, and vice versa. Thus, we need 
to solve five subproblems denoted as ” ic-subproblem”, ”p-subproblem”, ”v- 
subproblem”, ” e-subproblem”, ”rt-subproblem” as in Algorithm 2. The iterative 
scheme is as follows 





Algorithm 2 Alternating direction method of multipliers (ADMM) for (6) 


Fix Lagrange multipliers Ai = A^’ 1 \ A 2 = A 2 " ' and A 3 = A 3’ 1 then 
alternatively solve the following sub-problems: 


u-problem”: 

it (n ) = 

argmin C(u, v^-^, e (n_ 1 ) ,p (n_1) , 1 
uex 

A-problem”: 

t?(") = 

argmin C{vS n \ v , w^ 1 

vGX 

e-problem”: 

e(") = 

argmin £(i/"\ iA"\ e, p (n - 1} , it /"” 1 

e£X 

p-problem”: 

p(") = 

argmin C(vP n ' > , v^ n \ e^, p, it/"” 1 ) 
p£Y 

ui-problem”: 

= 

= argmin C{u^ n \v^ n \e^ n \p^ n \w\ 


uGRl 1 ! 


; ‘p-subproblem”: Fix it, v, e, w and 


P eY 


Pi 

2 


p - V d tt 


Pi 


( 8 ) 


12 


Let V d = , c? 2 ~] be the forward gradient operator [39]. The anisotropic 

version of Q is solved by 

p 1 = Shrink (<9+ u — , -j- ] and p 2 = Shrink ( d 2 u — , -j- ) , 


Pi Pi 


where the shrinkage operator is defined as 

x 


pi ’ Pi 


(9) 


Shrink (x , a) := — • max (|x| — a , 0). 
\x\ 


“w-subproblem”: Fix it, v, e, p and 


min <gi to , + 

™eR|zi 1 


P2 


w — C{v} + 


P 2 


( 10 ) 


The solution of (10) at the scale i and the orientation l is 


w it i = Shrink ^C{t>} - ) , i,l€l. 

“^-problem”: Fix it, e,p, w and 


( 11 ) 


min P2\\v\\ ei + 


P2 


w — C{v} + 


Ps 

2 


f — it — v — e 


Ps 


( 12 ) 
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This (12) is solved by 


v = Shrink A , 


M 2 


with 


A = 


fa + fa ) ’ 

C* {faw + A 2 } + /3 3 (/ — u — e + 


fa + fa 


“e-problem”: Fix it, v, p, w and 


min < G* ( t 1 + ^ 


e-(f-u-v +— 


12. 


This (15) is solved by (the proof is similar to [551) 


fa 


e = [ f — u — v + -r- ) — CST [ f — u — v + -r -, S 


fa 


(13) 

(14) 

(15) 

(16) 


with the curvelet soft-thresholding: CST(a;,a) := C*| Shrink(C{x} ,a)|. 
“«-problem”: Fix u,p, e,t u and 


• M 

mm < — 
uGX 2 


T7 I Al 
p- V d u+ — 


fa 

2 


f - u ~ v - € + T 3 


(17) 


Given the discrete finite frequency coordinates u = [wi,W 2 ] £ [—7r, 7r] 2 and 
let F(e^),V(e^), E{e j “) , A 3 (e^) ,Px(e? u ) ,P 2 (e J '“) and A^e^) be the 
discrete Fourier transform of f[k} ,v[k] ,e[k] , A 3 [fc] ,pi[k\ ,p 2 [fc] and Ai[fc], re¬ 
spectively. This @ is solved by 


u = Re 


r 




D (e jw ) 


with 


l/3 3 +4/3 1 [sin 2 (^)+sin 2 (f)] 

A 3 (G“) 


D(e> u ) = fa F(e j “) - V(e j “) - E(e jw ) + 


fa 


-fa 


(1 - e-~) (A(e*0 + Ahp) + (1 - e-~) (P 2 (e>“) + 


The updated Lagrange multiplier A^"\ A^ and A^ in (|7j) are 

a g = a^- 1] + ifa (p^-dtu^Y 
A $ = Ah _1) + 


A?>, = A& 1 ’ + 7&(»5’ -ft,,{«<">}), 


i( n_l ) 

x 2 ,i,l 


\M _ xC™- 1 ) 

A 3 — A 3 


7/?3 (/ 


- - v^ n) - e (n) 
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For a given 7 in Algorithm 1, the solution of ([3| is obtained by applying 
alternatively the above formulas in the subproblems. This is a convex program 
with the alternating minimisation procedure. However, the choice of parameters 
(/Ui, /z 2 , <5) and (/?i, /3 2 , Ps) affects on the solution. Since the texture information 
is an essential feature for the segmentation process, the parameter /ij and /r 2 are 
important, especially /i 2 controls the sparsity of the fingerprint texture. In this 
context, is adaptively designed to cancel (/ 1 2 +/I 3 ) in the shrinkage operator 


(13), it depends only on the maximum of A and the constant C, as follows 


H 2 = C{/3 2 + /3 3 ) • max(A[fc]), 


(18) 


where A[k] is defined in (14). Since the fingerprint images are captured by 
various kinds of sensors, their properties and qualities differ. Therefore, C is 
obtained empirically from training sets for each type of sensor. 

The parameter S is used to remove the small scale objects (noise). In order 
to reduce these kinds of noise in v such that v contains mainly fingerprint 
pattern as good as possible (cf. Figure [l]), we simply approximate these noise as 
Gaussian. See [55] for a combination of multiple noise models, e.g. considering 
Gaussian, Poisson and impulse noise, simultaneously. According to the extreme 
value behavior of the curvelet coefficients (cf. [59]), the threshold S is chosen 
with the quantile a = 0.7 from the asymptotic distribution as 


1 


* /hi—F tT 22-loglog|I| -logTT 

4 = „v/ 2 bI|Z|+„-- and a = -loglog_ a . 


, (19) 


where |X| is total number of curvelet coefficients and a is commonly calcu¬ 
lated from the first level of the Cohen-Daubechies-Feauveau 9/7 wavelet high- 
frequency diagonal coefficient (HH 1 ) (cf. [50]): 

median( \HH1\ ) 
a ~ 0.6745 ' 

Note that this approximation depends on the normality assumption of the noise, 
which may not always be true in practice. This can be adapted to different noise 
models as in general the threshold can always be obtained via simulation. 


2.1.4 Smoothness and Sparsity of the Extracted Texture 

The oscillation signal corresponding to fingerprint patterns is considered as a 
sparse and smooth texture which is decomposed by the G3PD model of the 
original fingerprint image f into three parts satisfying the constraint f = u + 
v + e (cf. Figure |3]), including: the piecewise-constant image u , the texture v , 
and noise e. 

In this section, we will analyze how the norms ||C{u}|| ( , and ||u|| fi in (|3j) 
affect on the smoothness and sparsity of the extracted texture which is our 
main goal for the feature extraction. In order to do that, a closed form of v is 
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(f) U(e^) (g) V(e»’") (h) E(e*") 


Figure 3: A fingerprint image and its Fourier spectrum are shown in (a) and (b), 
respectively. Image (a) is decomposed by G3PD with fii = 1, iteration = 20, level 
= 5, /3i = 0.06 ,^2 = P 3 = 7 = 10 _,i into cartoon (c), texture (d) and noise (e) 
images. Their respective Fourier spectra are visualized in (f-h). We observe that the 
Fourier spectra of the component images resemble responses after lowpass, bandpass 
and liighpass filtering. Please note that especially (d) and (g) show that the fingerprint 
pattern is mostly concentrated in a specific range of frequencies |B]. 


T 1 = ^andT 2 =^. 


found by putting (11) and (141 into (13), letting 9 = and the thresholds 


v = Shrink 9 C* { Shrink 


(CM 


Aa T 

~/V Tl 


!} + (l-0)(/-„-e + ! 


:= ^smooth « CST (v , Ti) 


:= v 


update 


( 20 ) 

We see that the estimated texture v contains two shrinkage operators: respec¬ 
tively, the inside and the outside correspond to the smoothness and sparseness 
terms resulting from ||Cu|| fi and 11^1^ in |3j). (cf. Figure[4]for the effects of the 
smoothness and sparseness of v after different numbers of iterations). These 
effects can be observed in the binarised texture (Figure [4] (f)). The parameter 
9 € (0,1) in ( [20| serves as a regularisation parameter to balance between the 
smoothing term v sm ooth and the updated term ^update- 

Figure |5j compares the effect of the curvelet smoothness term ||C{u} || f in (jlj) 
with a wavelet smoothness term ||yV’{u}|| £ and without smoothness measure¬ 
ment. The texture v estimated using ||W{v}|L in (fll) is not as good regarding 


,T 2 
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Figure 4: Image (a) depicts the original image where the yellow line indicates the 
boundary of the ROI estimated by the G3PD method after 20 iterations. The ROI 
is obtained using morphological operations on the binarized image (f). Images (b-e) 
show Smooth in (|20|>, the smoothing term of v and (g-j) visualize the corresponding 


^update in (201 after k iterations. 


smoothness and sparseness as the texture obtained by the curvelet smoothness 
term. In order to evaluate the convergence rate of the algorithm, we denote the 
relative error between successive iterations as 


Err!"-* = 




j( n ~i) I 


( 21 ) 


In Figure[5] one can see that without smoothness measurement, the convergence 
rate is slow (cf. 3rd row) and the algorithm tends to eliminate texture (cf. 
column 3 and 4). Note that for the smoothness measurement ||C{i;}|| , the 
proposed method achieves a stable estimated texture v after circa 20 iterations. 
Hence, the estimated v and its binarisation are almost the same after 20 or 50 
iterations (see Figure [4] (f) and 1st column of Figure [5]). 


2.2 Morphological Operations 

Firstly, the smooth and sparse texture v is extracted by the combination of the 
i\ norm of its curvelet coefficients and its t\ norms, simultaneously. Secondly, 
post-processing as described in [6] is applied to obtain the ROI. More specifically, 
the morphological operations act only on the non-zero coefficients of the texture 
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image v. In other words, this corresponds to a projection of the thresholding 
value to the parameter /i 2 which has been designed to adapt to the intensity of 
each image by Eq. (18). 


3 Evaluation: Benchmark, Protocol and Exper¬ 
imental Results 


3.1 Benchmark and Evaluation Metric 


The publicly available fingerprint images of the FVC competitions from 2000, 
2002 and 2004 are used as benchmark for evaluating segmentation performance. 
Each competition consists of four databases: three databases are acquired from 
real fingers and the fourth database of each competition is synthetically gener¬ 
ated. 

It has recently been shown that real and synthetic fingerprints can be dis¬ 
criminated with very high accuracy using minutiae histograms (MHs) |6l| . More 
specifically, by computing the MH for a minutiae template and then computing 
the earth mover’s distance (EMD) [52] between the MH of the template and the 
mean MHs for a set of real and synthetic fingerprints. Classification is simply 
performed by choosing the class with the smaller EMD. 

In total, there are 12 databases and each database contains 880 images (80 
for training and 800 for testing). The ground truth segmentation has been 
manually marked for these 10560 images as described in (Bj. 

Let N\ and N2 be the width and height of image f in pixels. Let Mf 
be number of pixels which are marked as foreground by human experts and 
estimated as background by an algorithm (missed/misclassified foreground). 
Let Mb be number of pixels which are marked as background by human experts 
and estimated as foreground by an algorithm (missed/misclassified background). 
The average total error per image is defined as 


Err = 


M f + M b 
N\ x IV 2 


( 22 ) 


3.2 Parameter Selection 


Parameters for all methods considered in the comparison are selected on the 
training set of 80 images for each database. More specifically, those parame¬ 


ters are chosen which minimize the segmentation error defined in (22) for the 


respective training set. Choosing the parameters for each database is appropri¬ 
ate, because the nine databases consisting of real fingerprints have been acquired 
using nine different sensors and the images of each database have sensor-specific 
properties. The parameter selection for the FDB [5], GFB [3], HCR [TO], MVC 
[H] and STFT |TT] methods are discussed in 

For the proposed G3PD method, the involved parameters are summarized 
in Table [I] and the values of the learned parameters are reported in Table [2] In 
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Parameters 

Description 

N 

the number of iterations in the Algorithm 1. 

Mi 

the regularised parameter for i\ norm of curvelet coefficients C{v\ 
in Eq. |2|. 

C 

the adaptive constant in Eq. (18) for the regularised parameter P 2 
in £1 norm of v in Eq. ([ 2 J) . 

Pi j P 2 > P 3 

the parameters in the augmented Lagrangian function (4). 

7 

the rate of the updated Lagrange multipliers in Eq. (41. 

s 

the window size of the block in the postprocessing step in [HI Eq. ( 8 )]. 

t 

a constant for selecting the morphology threshold T in [ 6 J Eq. ( 8 )]. 

b 

the number of the neighbouring blocks in [ 6 ] Eq. ( 8 )]. 

P 

the mirror boundary condition to avoid the boundary effect. 


Table 1: Overview over all parameters for the global three-part decomposition 
(G3PD) method for fingerprint segmentation. Values are reported in Table [5] 


FVC 

DB 

C 


2000 

1 

0.045 

0.0005 


2 

0.045 

0.0100 


3 

0.055 

0.0010 


4 

0.025 

0.0010 

2002 

1 

0.020 

0.0010 


2 

0.035 

0.0005 


3 

0.070 

0.0010 


4 

0.020 

0.0500 

2004 

1 

0.015 

0.1000 


2 

0.025 

0.0010 


3 

0.035 

0.0010 


4 

0.035 

0.0005 


Table 2: Overview over the parameters learned on the training set. The other 
eight parameters are fi\ = 1, /3i = @3 = 7 = 10~ 3 , s = 9, t = 5, 6 = 6 and 
p = 15 for all databases. 
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a reasonable amount of time, a number of conceivable parameter combinations 
were tried on the training set. 

For different numbers of iterations, we have applied the following training 
scheme: 


Firstly, C, an adaptive constant for /i 2 in (181 to define a threshold for 


the sparseness of v , is trained while fixing the other parameters. 

• Secondly, with the obtained C, we train the other parameters one by one 
while fixing the rest. 

The two parameters which have the biggest impact on the segmentation 
performance are the number of iterations N and the constant C in Eq. (18). 


Therefore, these two parameters have been trained first. In our experiments, the 
minimum error on the training set averaged over all 12 databases is obtained 
for N = 4 iterations. In these practical applications of our proposed model, 
stopping before convergence leads to better segmentation results which are also 
influenced by the combination with the morphological operations. For further 
details and a discussion, see [551 . 

Note that the solution of (it, v , e) depends severely on the choices of (/ri , /z 2 , S), 
as well as the parameters of the optimisation step (/3i, /?2 > P3 , 7 ). To achieve a 
good decomposition in which cartoon, texture and noise are separated is difficult 
in practice, because there are no models of noise and texture. Fortunately, this 
paper focuses on the segmentation of fingerprint images for which the texture 
v is important. After the decomposition, there can still be pattern contents in 
the cartoon image u and the noise image e (see Figure [I]), but the important 
aspect is that the texture image v is adequate for segmentation. 

The choice of aforementioned parameters balances the amount of pattern in 
the texture image with the smoothness of the cartoon image. Selecting param¬ 
eters which increase the smoothness of the cartoon image u, also tend to cause 
the halo effect in the texture image v. We observe that especially /3\ influences 
this trade-off: if u contains only homogeneous regions (cf. Figure 1 (c)), it 
tends to generate the halo effect on the boundary of fingerprint pattern in v (cf. 
Figure [3] (d)). Particularly, the halo effect results from the blurred homogeneous 
region u. In order to reduce this effect in v , the parameters are chosen such that 
the algorithm assigns “enough” texture to v. Hence, u and e can contain some 
partial textures, but this yields better a segmentation performance, cf. [53] . 

Let us consider the comparison of the proposed model with the standard 
ROF TV —L 2 model [21. and the TV —Li model [63j for feature decomposition 
(see Figure [7]). For simplicity, let Atvl 2 and AtvLi be the regularisation pa¬ 
rameters for TV — L 2 and TV — L \, respectively. The ROF TV — L 2 model has 
been introduced by pH] for the purpose of image denoising. The ROF model 
has been designed to obtain a smooth cartoon image u. For fingerprint image 
segmentation we are interested in a texture image which is as useful as possible 
in terms of a feature for segmentation. However, the ROF model or the TV — L\ 
model cannot produce a sparse and smooth texture image from a noisy finger¬ 
print image f no matter how the corresponding parameter is selected. On the 
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FVC 

DB 

GFB 0 

HCR nni 

MVC 0 

STFT Tl 

FDB 0 

G3PD 

2000 

1 

13.26 

11.15 

10.01 

16.70 

5.51 

5.69 


2 

10.27 

6.25 

12.31 

8.88 

3.55 

4.10 


3 

10.63 

7.80 

7.45 

6.44 

2.86 

2.68 


4 

5.17 

3.23 

9.74 

7.19 

2.31 

2.06 

2002 

1 

5.07 

3.71 

4.59 

5.49 

2.39 

1.72 


2 

7.76 

5.72 

4.32 

6.27 

2.91 

2.83 


3 

9.60 

4.71 

5.29 

5.13 

3.35 

3.27 


4 

7.67 

6.85 

6.12 

7.70 

4.49 

3.63 

2004 

1 

5.00 

2.26 

2.22 

2.65 

1.40 

0.88 


2 

11.18 

7.54 

8.06 

9.89 

4.90 

4.62 


3 

8.37 

4.96 

3.42 

9.35 

3.14 

2.77 


4 

5.96 

5.15 

4.58 

5.18 

2.79 

2.53 

Avg. 


8.33 

5.78 

6.51 

7.57 

3.30 

3.06 


Table 3: Error rates (average percentage of misclassified pixels averaged over 
800 test images per database) computed using the manually marked ground 
truth segmentation and the estimated segmentation by these methods: a Gabor 
filter bank (GFB) response based method by Shen et al. [5], a Harris corner 
response (HCR) based approach by Wu et al. [TO], a method by Bazen and Gerez 
using local grey-level mean, variance and gradient coherence (MVC) as features 
[8], a method applying short time Fourier transforms (STFT) by Chikkerur et 
al. m , the factorized directional bandpass (FDB) [BJ and the proposed method 
based on the G3PD model. 


one hand, if the ROF model decomposes / into a very smooth cartoon image 
u, than v contains both noise and texture. On the other hand, for a differ¬ 
ent choice of Atvl 2 or AtvXh v contains mostly noise and u includes texture 
and large scale objects. In neither of the two situations, u or v is useful as a 
feature for fingerprint segmentation. A comparison of the G3PD method with 
T Y — Li and TV —L 2 two-part decomposition is shown in Figure]?] Zhang et al. 
[Ml have tried to solve this problem by proposing a locally adaptive two-part 
decomposition which also takes the orientation of the pattern into account. 

In summary, the proposed G3PD method yields a satisfactory performance 
judged by visual inspection (see Figure [6] for one example from each database) 
and it outperforms the other methods on ten of twelve databases, see Table [3] 
This demonstrates the robustness of the G3PD method for fingerprint segmen¬ 
tation. 


4 Conclusions 

We have presented a global framework for the fingerprint segmentation problem 
which is to separate the foreground from background based on texture analysis. 
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We have proposed the G3PD method for three-part decomposition of fingerprint 
images. The texture pattern is analyzed under the variational approach consid¬ 
ering sparsity and smoothness at the same time: with the £i-norm for sparsity 
and td-norm of curvelet coefficients for smoothness. The resulting texture image 
is binarised and postprocessed by morphology to obtain the region of interest. 

We have proposed a model for three-part decomposition which takes the na¬ 
ture of the texture occurring in real fingerprint images into account. Fingerprint 
images are characterized by a smooth, curved and oriented pattern which has a 
sparse representation in certain transform domains. 

The G3PD method is somewhat similar in spirit to the FDB method [6] 
which also takes into account the specific properties of fingerprint patterns. 
Frequencies occurring in real fingerprints are mostly located in a specific range 
in the Fourier domain and the corresponding texture is extracted by an elaborate 
bandpass filtering process involving forward prediction, proximity operator and 
backward projection. Similarly, the three-part decomposition can be regarded 
as lowpass, bandpass and highpass filtering of signals corresponding to it, v and 
e, respectively (see images (f-h) in Figure [3]). This illustrates the connection 
between classical bandpass filtering in the Fourier domain and the variational 
approach. 

In conclusion, we have performed an extensive comparison of the G3PD 
method with five state-of-the-art fingerprint segmentation algorithms on a large 
benchmark with a variety of different challenges and have found that the G3PD 
method outperforms its competitors on ten out of twelve database in terms of 
segmentation accuracy. 

We believe that this work paves the way for further research in areas such as 
latent fingerprint segmentation in which we deal additionally with other kinds of 
noise like large scale structure noise, or to better deal with the few low-quality 
examples which still pose problems to the method. We believe that further 
improvements can be achieved by combining the G3PD method with additional 
features, e.g. the texture image obtained by the FDB method. 


Data Availability Statement 

Matlab Implementation of the G3PD Method for Fingerprint Segmentation 
http://dx.doi.org/10.6084/m9.figshare.1418020 

Benchmark for Fingerprint Segmentation Performance Evaluation 

http://dx.doi.org/10.6084/m9.figshare.1294209 

Matlab Implementation of the FDB Method for Fingerprint Segmentation 

http://dx.doi.org/10.6084/m9.figshare.1294210 

FVC databases 

http://bias.csr.unibo.it/fvc2000/ 
http://bias.csr.unibo.it/fvc2002/ 
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Figure 5: The following comparison illustrates the effect of the smoothness term for v 
in ill. The first column depicts the h norm of curvelet coefficients, i.e. ||C{u}||^ and 
the second column the l\ norm of wavelet coefficients, i.e. 11 W{v} 11 ( after 50 iterations. 
Columns 3 and 4 visualize v obtained without smoothness term (no ||C{v}||^ in (JTJ) ) 
after 20 and 50 iterations, respectively. The first row shows the texture images v , 
the second row their binarized versions and the third row their plots of convergence 
rates. The comparison shows that the curvelet based smoothness term leads to a better 
texture image than the wavelet based one and that convergence without smoothness 
term is slow and texture tends to be destroyed. 
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Figure 6: Segmented fingerprint images and the corresponding texture images 
by the variational method for FVC2000 (first and second row), FVC2002 (third 
and fourth row) and FVC2004 (fifth and sixth row). Columns f.l.t.r correspond 
to DB1 to DB4. 
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(a) (b) G3PD: /3i = (c) 

0.005 




Figure 7: A comparison of G3PD with TV — L 2 and TV — L\: First row: images 
f.l.t.r are the original image f and the three-part decomposition by G3PD with N = 
50,/3i = 0.005 (see Table [ 2 ] for the other parameters): the cartoon image u , texture 
image v and noise image e. The first and second column of rows two to four show 
images u and v , respectively, for TV — L 2 two-part decomposition with different values 
of Atvl 2 - The third and fourth column show the corresponding images u and v for 
TV — L 1 two-part decomposition. The number of iterations for TV — L 2 and TV — L\ 
is N = 350. Note that for no choice of Atv_l 2 or Atvlu TV — L 2 or TV —Li produce a 
good feature image for segmentation of this noisy fingerprint, while the G3PD model 
provides a useful texture image v for the segmentation procedure. 
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